Nondeterministic Finite Automata in Hardware - the Case of the Levenshtein Automaton

نویسندگان

  • Tommy Tracy
  • Mircea Stan
  • Nathan Brunelle
  • Jack Wadden
  • Ke Wang
  • Kevin Skadron
  • Gabriel Robins
چکیده

The Levenshtein Nondeterministic Finite state Automaton (NFA) recognizes input strings within a set edit distance of a configured pattern in linear time. This automaton can be pipelined to recognize all substrings of an input text in linear time with additional use of nondeterminism. In general, von Neumann hardware cannot directly execute NFAs without significant time or space overhead. A von Neumann simulation of the Levenshtein automaton incurs exponential run time overhead in the general case. A common technique to avoid the simulation overhead is to convert the pipelined NFA to a DFA, but at the expense of heavy pre-computation and high space overhead. In this paper, we introduce a novel technique for executing a pipelined Levenshtein NFA using Micron’s Automata Processor (AP), avoiding the run time and space overheads associated with CPU and GPU implementations. We show that run time remains linear with the input while the space requirement of the automaton becomes linear in the product of the configured pattern length and edit distance. These properties allow the AP to execute large instances of the Levenshtein NFA or many small instances in parallel thus making the automaton a viable building block for future approximate string applications on the AP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Programming for Reduced NFAs for Approximate String and Sequence Matching

We present a new simulation method for the reduced nondeterministic nite automata (NFAs) for the approximate string and sequence matching using the Levenshtein and generalized Levenshtein distances. These reduced NFAs are used in case that we are interested only in all occurrences of a pattern in an input text such that the edit distance between the pattern and the found strings is less or equa...

متن کامل

Nondeterministic soliton automata with a single external vertex

Soliton automata are the mathematical models of certain possible molecular switching devices called soliton circuits. Both from theoretical and practical point of view, it is a central question to describe soliton automata with a single external vertex. Extending a result of Dassow and Jürgensen, we characterize nondeterministic automata in this special case.

متن کامل

Subset Encoding: Increasing Pattern Density for Finite Automata

Micron’s Automata Processor is an innovative reconfigurable hardware accelerator for parallel finite-automatabased regular-expression matching. While the Automata Processor has demonstrated potential for many pattern matching applications, other applications receive reduced benefit from the architecture due to capacity limitations or routing limitations. In this paper, we present an efficient i...

متن کامل

Ambiguity of Unary Symmetric Difference NFAs

Okhotin [9] showed an exponential trade-off in the conversion from nondeterministic unary finite automata to unambiguous nondeterministic unary finite automata. In this paper, we consider the trade-off in the case of unary symmetric difference finite automata to finitely ambiguous unary symmetric difference finite automata. Surprisingly, the trade-off is linear in the number of states of the fi...

متن کامل

Simulation of NFA in Approximate String and Sequence Matching

We present detailed description of simulation of nondeterministic nite automata (NFA) for approximate string matching. This simulation uses bit parallelism and used algorithm is called Shift-Or algorithm. Using knowledge of simulation of NFA by Shift-Or algorithm we design modi cation of ShiftOr algorithm for approximate string matching using generalized Levenshtein distance and modi cation for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015